Considering Multiple Options when Interpreting Spoken Utterances

نویسندگان

  • Sarah George
  • Ingrid Zukerman
  • Michael Niemann
  • Yuval Marom
چکیده

We describe Scusi?, a spoken language interpretation mechanism designed to be part of a robot-mounted dialogue system. Scusi?’s interpretation process maps spoken utterances to text, which in turn is parsed and then converted to conceptual graphs. In order to support robust and flexible performance of the dialogue module, Scusi? maintains multiple options at each stage of the interpretation process, and uses maximum posterior probability to rank the (partial) interpretations produced at each stage. The time and space requirements of maintaining multiple options are handled by means of an anytime search algorithm. Our evaluation focuses on the impact of the speech recognizer and the search algorithm on Scusi?’s performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interpreting Pointing Gestures and Spoken Requests - A Probabilistic, Salience-based Approach

We present a probabilistic, salience-based approach to the interpretation of pointing gestures together with spoken utterances. Our mechanism models dependencies between spatial and temporal aspects of gestures and features of utterances. For our evaluation, we collected a corpus of requests which optionally included pointing. Our results show that pointing information improves interpretation a...

متن کامل

Interpreting Unit Segmentation of Conversational Speech in Simultaneous Interpretation Corpus

The speech-to-speech translation system is becoming an important research topic with the progress of the speech and language processing technology. Considering efficiency and the smoothness of the cross-lingual conversation, the simultaneity of the translation processing has a great influence on the performance of the system. This paper describes interpreting unit segmentation of conversational...

متن کامل

Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus

This paper provides an investigation of simultaneous interpreting patterns using a bilingual spoken monologue corpus. 4,578 pairs of English-Japanese aligned utterances in CIAIR simultaneous interpretation database were used. This investigation is the largest scale as the observation of simultaneous interpreting speech. The simultaneous interpreters are required to generate the target speech si...

متن کامل

Open-Vocabulary Retrieval of Spoken Content with Shorter/Longer Queries Considering Word/Subword-based Acoustic Feature Similarity

Acoustic feature similarity between utterances has been shown to be very helpful for spoken term detection using pseudorelevance feedback (PRF) and graph-based re-ranking. Both cases are based on the concept that utterances similar to those utterances with higher relevance scores in acoustic features should have higher scores, while graph-based re-ranking further considers the similarity struct...

متن کامل

Eliminating Traces of Spoken Language Production in Automatic Dialogue Interpreting

This paper focuses on a particular problem of Automatic Dialogue Interpreting. As the input is unplanned spoken language, it contains certain traces of the effort to formulate the utterances (such as false starts, filled pauses, self-repairs or repetitions). In an adequate translation, these traces are obviously to be eliminated. I introduce a pragmatic-based approach to Automatic Dialogue Inte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007